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Abstract. Let T : X — ► Y be a bounded linear map between Banach spaces 
X and Y. Let T* : Y* — > X* be its adjoint. Let Bx and By be the closed 
unit balls of X and Y* respectively. We obtain apparently new estimates for 
the covering numbers of the set T* (By*)- These are expressed in terms of 
the covering numbers of T(Bx), or, more generally, in terms of the covering 
numbers of a "significant" subset of T(Bx)- The latter more general esti- 
mates are best possible. These estimates follow from our new quantitative 
version of an abstract compactness result which generalizes classical theorems 
of Arzela-Ascoli and of Schauder. Analogous estimates also hold for the cov- 
ering numbers of T(Bx), in terms of the covering numbers of T* (By*) or in 
terms of a suitable "significant" subset of T* (By*). 

1. Introduction 

The main motivation for the main result of this note is to give quantitative 
versions of the celebrated Schauder theorem about adjoints of compact operators. 
In fact our versions also apply to operators which are not compact. 

When we hrst obtained these results it seemed hard to imagine that they are not 
already known. But so far we have not found any references to similar results in 
the literature. We invite the reader to inform us of any such references. In future 
versions of this note we hope to include a more extensive survey of related previous 
results. 

Of course there are remarkable recent results by Shiri Artstein-Avidan, Vitali 
Milman, Stanislaw Szarek and Nicole Tomczak-Jaegerman [TJ [2] which give es- 
timates for covering numbers which look much much stronger than the ones we 
shall give here. But they deal with a slightly different kind of question. Still more 
recently, Emanuel Milman [9] has obtained results which in many cases also give 
substantially better estimates than those given below in our Corollary [8] However 
his estimates contain factors and exponents which depend on the dimension of the 
underlying space, and ours do not, at least not in any explicit way. Moreover, 
we obtain a quantitative version of a natural variant of Schauder's Theorem, (see 
Corollary [9]) where our estimates are best possible. 

Our point of departure is a theorem which is general enough to contain the 
classical theorems of Arzel-Ascoli and of Schauder as special cases. But it, in turn, 
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can be considered as a special case of considerably more abstract and general results 
presented by Robert G. Bartle in [3], and which have their roots in earlier work of 
Kakutani, R. S. Phillips and Smulian. 

The said theorem appears in [7] as a prelude to other results dealing with finitely 
additive means and semigroups of operators. It also appears in [5], where it is used 
as a tool to study complex interpolation of compact operators. Each of us obtained 
the theorem independently, and before we became aware of the earlier and more 
general results of [3]. 

We will state the theorem in the same formulation as is used in [5]. Perhaps 
we first need to recall that a semimetric space (E, d) (also often referred to as a 
pseudometric space) is a set E equipped with a function d : E x E — > [0, oo) which 
is a semimetric, this meaning that it satisfies all the usual conditions for a metric, 
except that the condition d(x, y) — does not imply that x = y. The definition of 
a totally bounded semimetric space (which will not be explicitly needed anyway in 
the sequel) is exactly analogous to that of a totally bounded metric space. 

Theorem 1. (Cf. [3J [7],) Let A and B be two sets and let h : A x B — > C be a 
function with the properties that 

(1) sup \h(a, 6)| < oo for each fixed b £ B, and 

a£A 

(2) sup \ h(a, 6)| < oo for each fixed a £ A. 

b£B 

Define <i^(ai, 02) := sup bgB \h(ai, 6) — h(a2, 6)| for each pair of elements a\ and 
in A. 

Define ds(6i, 62) = sup a6 ^ \h(a, 61) — h(a, 62)! for each pair of elements 61 and 62 
in B. 
Then 

(3) [A, c2a) and (B,ds) are semimetric spaces 
and 

(4) (A, dji) is totally bounded if and only if(B,dg) is totally bounded. 

The claim ([3]) in the previous theorem is of course a trivial consequence of the 
definitions of c?^ and ds and we shall use it in the sequel here without further 
comment. 

There are proofs of Theorem [T] in both [5] and [7] . The advantage of the proof 
in [7] is that it can be adapted to give a quantitative version of the theorem, and 
therefore also of Schauder's Theorem. That is what we do in this note. 

Before we can state our new results we need to fix and discuss some more (es- 
sentially standard) notation and terminology: 

For each Banach space X we denote the closed unit ball of X by Bx- 

We use the standard notation of the "ceiling function", i.e., for each t g R we 
let \t\ denote the smallest integer which dominates t. 

Let (E, d) be a semimetric space. We wish to define balls in (E, d), the diameters 
of subsets of E, and two kinds of covering numbers for E. Our definitions will be 
mostly obvious variants of familiar ones for metric spaces. Covering numbers in 
metric spaces are often defined in terms of coverings by open balls of fixed given 
radius, (whose centres may or may not be required to be in some particular subset 
being considered). Here we will find it convenient to instead use coverings by "closed 
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balls" or arbitrary sets of fixed given diameter. We have permitted ourselves these 
slight "perturbations" of the usual definitions in order to enable the convenient 
formulation of examples showing that at least some of our results are best possible. 
Of course our results can easily be translated into results corresponding to covering 
numbers defined via coverings using open balls. 

For each subset G of E we define the diameter of G to (of course!) be the 
quantity 

diam(G) = sup d(x,y) . 

x,yeG 

For each x G E and each r > we refer to the sets {y G E : d(x, y) < r} and 
{y G E : d(x, y) < r} respectively as the open and closed balls of radius r centred 
at x. 

For each semimetric space (E, d) and each e > 0, the intrinsic covering number 
Ne{£) is the least positive integer n for which there exists a finite subset F C E of 
cardinality n such that 

(5) mind(a;, y) < e for each x G E , 

y£F 

i.e., Ne(c) is the smallest n such that E is contained in some union of n closed balls 
of radius e. If no such n G N exists, then Ne(c) = oo. 

We use the word "intrinsic" in this definition to reflect the fact that if E happens 
to be contained in a larger semimetric space and d is the restriction to E x E of the 
semimetric for that larger space, then we are requiring the centres of all the closed 
balls to be in E. This requirement will be relevant in the settings of Corollaries 
and [9] 

For each semimetric space (E, d) and each e > we define the diameter covering 
number N^(e) to be the smallest positive integer n for which there exist n subsets 

Ei, E2, , E n of E, each having diameter not exceeding 2e and for which E c 

\Jij—X Ej. If no such positive integer n exists, then iVjj (e) = 00. 

The triangle inequality for the semimetric d immediately gives us some simple 
connections between intrinsic covering numbers and diameter covering numbers, 
namely 



(6) N E (2e) < N£(e) < N E (e) for all e > . 

The naive hope that each arbitrary subset of E having diameter not exceeding 2e 
might be contained in a ball of radius e is immediately shattered by the following 
example: In R 2 with euclidean norm take E to be the interior of an equilateral 
triangle of side length 2e. However we can make the following simple observation, 
which will turn out to be relevant later in the setting where we will show that one 
of our results is best possible. 

Fact 2. Let m be a positive integer, let d(x, y) — \\x — y\\^ for each x, y G K m and 
let E be a subset o/R m . Let G be a subset of E with diam(G) < 2e. Then there 
exists a point x G R m such that G C {y G E : d(x, y) < e}. 

Proof. For each k = 1,2,...., to and each x G K m , let irk(x) denote the k th 
coordinate of x. We let — inf {%k(x) : x G G} and (3k — sup{7Tfc(a;) : x G G}. 
The finiteness of diam(G) ensures that both and (3k are finite. Then the bound on 

diam(G) ensures that < (3 k -a k < 2e. The point x = f 2 ^, , a "+ ;3m ) 

clearly has the required property. □ 
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Remark 3. Note that the point x in Fact [2] does not have to belong to G or even 
to E. If G happens to be convex and closed then we can have x € G for m < 2. 
But this need not happen for m > 3 as one can see by considering the set GcM 3 
which is the convex hull of (—1, 1, 1), (1, —1, 1) and (1, 1, —1). 



We are now ready to state our main result, and then its corollaries. Their proofs 
will be given in Section [5] 

Theorem 4. Let A and B be two sets and let h : A x B — > C be a function with 
the properties stated in Theorem^ Let dA and ds be the semimetrics defined on 
A and B respectively, as in Theorem [7J 

Suppose that the intrinsic covering number Nji(e) is finite for some positive num- 
ber e. Then 

(i) The quantity C := swp aeA beB \h(a, b)\ is also finite. 

(ii) The diameter covering number Ng(p) is finite for each p > e. 
(Hi) Furthermore, 



(7) 



N£(e + 6)< 



V2C 



2N A (e) 



for each 5 > . 



and, if h is real valued, the following stronger estimate also holds. 



(8) 



N£(e + 5)< 



r (71 \ N M e ) 



for each S > . 



(iv) By symmetry, the roles of A and B can be interchanged and so exactly 
analogous estimates hold for N^(e + S) in terms of A^s(e). 

Remark 5. Note that in this theorem we do not make any "compactness" or "total 
boundedness" assumptions about (A, d&) or {B,dB)- 

Remark 6. In general, in Theorem [4] we cannot expect to have any estimate for 
the supremum C just in terms of A^4(e). However we do have an estimate for C 
in the following particular case: Assume that A is an absolutely convex subset of a 
linear space V and that the semimetric dA defined as above also satisfies 



(9) 



oIa(cl, a') = p(a — a) 



for some seminorm ponV and all a, a' € A. Assume furthermore that the function 
h : A x B ^ C satisfies 



(10) 



h(0, b)=0 for all beB. 



In this case, from straightforward "geometric" considerations, one can expect the 
supremum C to satisfy 

(11) C<eN A (e), 

and we will prove in an appendix (Section[5]) that (jTTJ) indeed holds. Consequently, 
in this case, Theorem 2] gives us that 



(12) 



N£(e + 6) < 



~V2eN A (e) 



2N A (e) 
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and, if we also have that h is real valued, we can sharpen this to 

-eN A (e)^ NA(e) 



(13) 



N§(e + 8) < 



In our first application (Corollary [5]) of Theorem 2] to the context of Schauder's 
Theorem, the set A, the function h and the scmimetric cLa satisfy the conditions of 
this particular case. But of course there we have a simpler and, in general, better 
way to estimate C. We apparently cannot invoke an analogue of (flT]) in Corollary 

m 

Remark 7. One of the examples to be presented in Section [3] will show that the 
estimate ((8|) cannot be sharpened. On the other hand, it will be clear from the 
proof below that the estimates (|7|) and (fT2|) can be somewhat improved, either by 
making some small efforts in planar geometry or by consulting some appropriate 
references, in order to sharpen the claim (ii) in Fact 1 121 Another example in Section 
[3] will show that the requirement that p > e in part (ii) of Theorem [4] cannot be 
replaced by any weaker requirement of the form p > <p(e) for some function <f> of e 
which satisfies cf>(e) < e. 

Here is the obvious simplest way that we can apply Theorem 21 

Corollary 8. Let X and Y be Banach spaces and let Bx and By* be the closed 
unit balls of X and Y* respectively. 

Let T : X — > Y be a bounded linear operator with adjoint T* : Y* — > X* . 

For each e > let iVr(e) denote the least number of closed balls in Y of radius 
e with centres in T (Bx) which are required to cover the set T (B x ), and let Nft(e) 
denote the least number of subsets ofY each with Y -norm diameter not exceeding 
2e which are required to cover T (Bx). 

Analogously, let Nt*(c) denote the least number of closed balls in X* of radius e 
with centres in T* (By* ) which are required to cover the set T* (By* ) and let Nft, (e) 
denote the least number of subsets of X* each with X* -norm diameter not exceeding 
2e which are required to cover T* (By*). 

Suppose that iVy(e) is finite for some particular e > 0. Then N^* (p) is finite for 
all p > e and the estimate 



(14) 

holds for all S > 0. 
sharpened to 

(15) 



V2\\T\ 



N£. (e + S)< 



If X and Y are real Banach spaces, then this estimate can be 



Nfi, (e + S)< 



ITI 



1\ N T (e) 



Furthermore, if A^*(e) is finite for some e > 0, then N^(p) is finite for all 
p > e and the quantity (e + S) can be estimated in terms of Nt*(c) via formulae 
exactly analogous to {1$ and H15\) , where T and T* are interchanged. 



Apparently other results will give much better estimates than ([14]) and ([15]) . But 
here is a slightly more subtle variant of Corollary [5] for which, in some cases, our 
estimates are best possible. With the perspective of Theorem 0] we can see that 
it may be just as appropriate and just as easy to work with the covering numbers 
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of certain "significant" subsets of T (Bx) and of T* (By*), instead of working with 
the covering numbers of these sets themselves. We will obtain new versions of the 
estimates (|14p and (fT5)) for N^,(e + 6), which are stronger in the sense that the 
number iVr(e) is replaced by a smaller, in some cases very much smaller number, 
which is the covering number of a suitable subset K of T {Bx ) ■ Similarly the 
estimates for N^(e+S), which were stated implicitly in CorollaryEl can be replaced 
by stronger results where Nt* (e) is replaced by the covering number of a suitable 
subset K* of T* (By.). 

Corollary 9. Let X, Y , T, N^(e), and N^,(e) all be as specified in the statement 
of Corollary [3 Let K be a "norming" subset of T(Bx), i.e., a subset with the 
property that 

(16) sup{|(u, y)\ : u G K} — sup{|(u,y)| : u G T(B X )} for each y G F* . 
Analogously, let K* be a subset of T* (By. ) with the property that 

(17) sup{|(ar,i;)| : v G K*} = sup{|(x,u)| : v G T*(B Y ,)} for each x G X . 

For each e > let N[K,e] be the least number of closed balls in Y of radius e 
with centres in K which are required to cover the set K. Analogously, let N[K*,e] 
denote the least number of closed balls in X* of radius e with centres in K* which 
are required to cover the set K* . 

Suppose that N[K,e] is finite for some particular e > 0. Then N^*(p) is finite 
for all p > e and the estimate 

- _ n \ 2N\K,e] 

V2\\T\\ 



(18) 



N£, (e + S)< 



I X—>Y 



holds for all 5 > 0. If X and Y are real Banach spaces then this estimate can be 
sharpened to 

1 \ N\K,e] 

(19) N£,(e + 6) < ' 



\\T\ 



X^Y 



Analogously, if N[K* , e] is finite for some particular e > 0, then N^{p) is finite for 
all p > e and the estimate 

- _ -i \ 2N\K*,e\ 

^\\T\\ X ^Y 1 



(20) 



Nfi(e + 5) < 



holds for all S > 0, and in the case where X and Y are real Banach spaces, it can 
be sharpened to 

-| x N[K*,e] 

(21) Nfi{e + 5) < 



\X^Y 

5 



Remark 10. Obviously Corollary [8] is nothing more than a special case of Corollary 
M since of course the sets K = T{B X ) and K* = T* (By*) satisfy (fT6]) and |[T7|) . 
But it seems better and clearer to have begun this discussion by stating that special 
case separately. 

What reasonable kinds of sets might play the roles of K and of K* in Corollary 
W Obviously we can take K = {Tx : \\x\\ x = 1} and K* = {T*y : \\y\\ Y , = 1}. 
But for such choices we cannot expect N[K,e] and N[K*,e] to be substantially 
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smaller than, respectively, Nx(e) and Nx*(e). In finite dimensional spaces, if Bx 
is the convex hull of some finite set F, then K can be chosen to be T(F). (We 
are aware of at least two papers, namely [J] and [B], which could be applied to 
give connections between the covering numbers, or entropy numbers, of T(F) and 
of T (Bx) in such cases.) If T has some special additional properties it might be 
possible to choose an even smaller set than T(F) in the role of K, 

Here is what is probably the most natural example of a choice of K which satisfies 
p6p and for which N[K, e] is very significantly smaller than Nx(e). Let X and Y 
both be R" equipped with the i 1 norm and let T be the identity operator on R™. 
Let K be the subset of Bx which consists of the n points ej for j — 1, 2, n, where 

ei = (1, 0, 0, 0), e 2 = (0, 1, 0, 0, ....,0), , e„ = (0, 0, 0, 1). Of course N T (e) 

is arbitrarily large for small values of e. But N[K, e] = n for all e in the range 
< e < 1. Of course in (fl6|) we take (•, •) to be the usual inner product on R", and 
so X* and Y* are both R n equipped with the £°° norm. Clearly (fTl)]) holds here 
since, for each y £ R", both sides of (fl6|) equal ||y||^oo- 

Let us now use this example to show that in general the estimate (1191) cannot be 
improved. In our context here (cf. Fact [2]), for each p > 0, we see that Njt. (p) is the 
minimal number of closed cubes in R" of side length 2p required to cover all of the 
closed cube Q = [— 1, l] n . The interval [—1,1] is contained in the union of ^ non 

(r i \ n 
) ' 

By trivial considerations of volume, we must also have (2p) n N^, (p) > 2™. It follows 
that, for each positive integer to, we have Njt(p) = m n for all numbers p which 
satisfy m n — 1 < p~ n < to". In particular this will hold whenever we choose 
p = (to" - ey 1/n for some number 8 £ (0, 1). For our purposes we will choose a 
particular value of 9 £ (0, 1) which is sufficiently small to ensure that 



(22) to- 1< (to" -6) 1/n 



< TO . 



For each number e which is in the range < e < min we have 

N[K,e] — n and we also have that the number 8 = 6(e) which satisfies e + 8 = 
(to" - ey 1/n is positive. For each such number e and for each corresponding 6 = 
8(e), the left side of (fT9|) equals to" and the right side equals ^ 



( 



(m"-6»)- 1 /"-e 



l-e(r, 



8 

. Since we have chosen 8 to satisfy 



we can now choose e sufficiently small, so that the number fjw^s^fuT^ a l so li es 
in the open interval (m — 1, to). This makes the right side of (fiT)]) also equal to to". 
So indeed the estimate fl9]) is best possible for certain values of the numbers e and 
(5, and in fact for infinitely many such values, which can be taken arbitrarily small. 

Remark 11. If we change this example so that X = Y is R" equipped with the 
i°° norm, then we can of course take K to consist of the 2" vertices of the cube 
of side length 2 centred at the origin, i.e., the extreme points of Bx- Or K may 
consist merely of half of these points. Perhaps we will look at this example in more 
detail in a subsequent version of this paper. 
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2. Proofs 

2.1. The proof of Corollary [9] (and therefore also of Corollary [8| . 

It is rather obvious what needs to be done, even more so in the case where K = 
T(Bx) and K* = T* (By*). But let us write out the proof explicitly, at least in 
this preliminary version of our paper: 

Let A — {x£ Bx ■ Tx € K}. We clearly have T(A) = K. We also let B = By ■ 
Define h : A x B -> C by h(a, b) = (Ta, b). Then we have 

(23) sup \h{a,b)\<\\T\\ x ^ Y =\\T*\\ Y .^ x . , 

aeA,b£B 

and also, by the Hahn-Banach Theorem, 

(24) d,A(ai, a 2 ) = \\Tai — Ta 2 || y for all ai, a 2 € A . 

Let e be a positive number for which -/V[_ftT, e] is finite and equals n. Then there 
exists a set of n points {yi, y 2 , • y n } in if such that min {\\y — yj\\ Y '■ 1 < j < n} < 
e for each y £ K. Since K = T (A), we have a set of n points {x%, X2, a;„} in A 
such that Txj = yj for each j = 1, 2, n. 

For each a £ A we have Ta 6 if and therefore 

min {d^ (a, Xj) : 1 < j < n} — min { \\Ta — Txj\\ Y 
This shows that 

(25) N A (e) < N[K, e] for all e > . 

In order to obtain a formula for g?b, we use the fact that h(a,b) is also equal to 
(a, T*b). For each pair of elements di,^ € 5 we again use the fact that T(A) = K 
and we apply (|16|) to obtain that 

d>B(bi,b 2 ) = sup |(Ta, 6i - b 2 )\ = sup |(u, b\ - b 2 )\ = sup \(u,bi-b 2 )\ 

aeA ueK ueT(Bx) 

(26) = sup \(Tx,h - b 2 )\ = sup |(a;,r*(6i-6 2 ))| = \\T*h -T*b 2 \\ x . . 

xeBx x£B x 

Next we will show that 

(27) Nft, (p) < N% (p) for all p > . 

Let p be an arbitrary positive number. If m = Ng(p) then there exists a collection 
Wi, W 2 ,...., W m of m subsets of B = By such that sup b b , eW . d,B(b,b') < 2p for 
each j = 1, 2, m and B C Ujli Wj- Let V^- = T* (Wj) for each j. Each set Vj 
is contained in T* (By ) and, in view of (|26l) . 

(28) sup ||«-u'|| x . = sup ||T*6-T*6'|| X . = sup d B (b,b')<2p. 

v,v'<£Vj b,b'£Wj b,b'eWj 

We also have T* (By) C UJli ^ which, together with gives us (j27)) . 

Now to obtain ^TSJ and (O we simply have to substitute §Ff), and (JH]) in 
d?]) and ([H]) respectively. 

The rest of the proof, i.e., of the estimates (f2T))) and of (12~T]) . is a variant of the 
arguments to obtain (fTSj) and P^)l that we have just presented. For this last part 
of the proof we will choose A = {y £ By -T*y G K*} and i? = Bx and /i(a, 6) = 
(Tb,a) = (6, T*a), and then use reasoning which is almost exactly analogous to 
that used above, beginning with the observation that T*(A) = K* . We will only 
need to make two small and obvious changes: 
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• The formula for <1a, i.e., GU(ai,»2) — \\T*ai — T*a2\\ x *i whose analogue in 
(|24|) followed from the Hahn Banach Theorem, here is simply a consequence of the 
definition of the norm of a linear functional. 

• The last step in l|26p above simply used the definition of the norm of a linear 
functional. The analogous step here, which shows that ds (b%, 62) = \\Tbi — Tf^Hy, 
will use the Hahn-Banach theorem. 

We leave the details to the reader. □ 



V2C 



2.2. How to cover big intervals by small intervals, and big disks by small 
disks. 

This very simple geometrical observation will be one of the components in our 
proof of Theorem |4] 

Fact 12. Let 5 and C be numbers satisfying < 5 < C . 

(i) The interval [—C,C] can be contained in the union of [~S-1 closed intervals 
I q , g=l,2, ["SI , each of length 28. 

(ii) The closed disk {z G C : \z\ < C} can be contained in the union of 

r r I 2 

closed disks D q , q = 1, 2, each of radius 8. 

Proof. The claim (i) is trivial and needs no further comment. 

The number of disks mentioned in the claim (ii) can certainly be reduced. How- 
ever, in this preliminary version of the paper, we will not seek the optimal value of 
this quantity, but simply content ourselves with some quite crude, simple minded 
estimates. First we remark that each closed disc of radius 8 contains a closed square 
of side length y/25. If the positive integer k satisfies k > 2C j\[28 then we will be 
able to cover a closed square of side length 2C with k 2 closed squares of side length 
V2<5. So we will certainly be able to cover a closed disc of radius C with k 2 closed 

discs of radius 6, where k^l^Y. □ 



2.3. The proof of Theorem [H 

Throughout this proof the number e > 0, for which we suppose that A^i(e) is finite, 
will remain fixed, and n will always equal Na(c)- We also fix a finite n element 
subset F = {ai, et2, , a n } of A which has the property 

(29) min{dA(a,aj) 13 G {l,2,...,n}} <e. 

At least one such subset necessarily exists. 

C will always denote the supremum C := sup aeA beB \h(a,b)\. 

Our first task is to show that the supremum C is finite: Let a and a' be two 
arbitrary elements of A. Let Cf = HiaXj,fce{i,2 ,n} ^Aiflj, a>k)- Then, for each j 
and k in {1, 2, n} we have 

cU(a, a') < d A (a, a 3 ) + ^(a^, a k ) + d(a k ,a') . 

By making appropriate choices of j and k we obtain that 

(30) d A (a,a') < 2e + C F . 
Now, for arbitrary a € A and b G B, we have 

\h(a, 6)| < \h(a, b) - h( ai ,b)\ + \h( ai b)\ 
< dA(a, ai) + sup \h(cLx, b') \ . 

b'eB 
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In view of (|30|) and ((2]) this last expression is finite and does not depend on a or b. 
So our assumption that Na(c) is finite indeed ensures the finiteness also of C. 

Let 6 be an arbitrary positive number. It remains to show that Ng(e + S) is 
finite and satisfies the required estimates. 

It will be convenient to simultaneously consider both the general case and the 
special case where h is real valued. To facilitate this we will let K denote the real 
field R if h is real valued, and otherwise we will have K = C. 

Let us now consider the set G = {<?& : b £ B} of functions <?& : F — > K defined 
by the formula #&(/) = h(f,b). G is of course contained in the set Q C £°°(F) 
consisting of those functions g : F — > K for which \g(f)\ < C for each /. 



If IK = C then we let M 



V2C 

s 



and let D±, D2, , Dm be the M closed 

discs of radius S in the complex plane provided by Fact \12\ whose union contains 
the closed disk {z S C : \z\ < C}. For each n-tuple ft = Mn) 5 where each 

Hj is an integer in the range 1 < pj < M, let Q(pi, p.2, = Q (ft) be the set of 

all functions g : F — + C such that g(dj) E D llj for j = 1, 2, ...,n. Furthermore, let 
B(jii, /L«2, /in) = 5 (/2) be the set of all 6 €E B for which the function <?& defined 
as above is in Q([ii, jJb2, ^n)- 

The inclusion {z £ C : \z\ < C} C Ugli ^9 implies that G C Q (jl) which in 
turn gives us that B C [JpB (ft), i.e., that B is contained in the union of the M n 
sets B (ft) . 

Analogously, if K = R then we choose a different value for M, namely M = 

and we let I±, I 2 , , Im be the M closed intervals of length 28 provided 

by Fact [T2l whose union contains the closed interval [— C, C}. For each n-tuple 
ft = (/ii, fj,2i ••••! A*n)i where each /ij is an integer in the range 1 < fij < M, let 
Q(/Ui, fi2, Mn) = Q (ft) be the set of all functions ,g : F — > R such that <?(aj) S 
for j = 1, 2, n. Furthermore, let B(^i,fi2, M«) = B (P) be the set of all b e B 
for which the function gb defined as above is in Q(p\,fi2, ••••,/■'«)• As before, this 
time using the inclusion [— C, C] C Ug=i^9J we show that B is contained in the 
union of the M n sets B (ft), this time of course for this different choice of M. 

Now we shall estimate the diameter of each set B (ft) . We will use the same 
argument for both of the cases IK = C and K = R. Fix some ft = (/ii,/i2, ....,//„) 
and let b and b' be two arbitrary elements of B (ft) . Fix some arbitrary positive 
number p and let a be an element of A (possibly depending on p) for which 

d B (b,b')< \h(a,b)-h(a,b')\+p. 

Next, using (|29[) . we pick some aj £ F (possibly depending on a and therefore on 
p) for which d,A(a, aj) < e. Then we have 

\h(a, b) - h(a, b')\ < \h(a, b) - h(a h b)\ + \h( aj ,b) - h(a 3 ,b')\ + \h( aj ,b') - h(a, b')\ 
< e+\gb(a.j)- g b <(aj)\+e. 

By definition, when K = C the two numbers gb(cij) and gb'(dj) are either both in 
the same disk . of radius S , and when K = R they are both in the same interval 
I fJ _ j of length 25. So, for both K = C and K = R, we have 

\9b(a 3 ) - 9b'(aj)\ < 25 

and the previous three displayed estimates combine to give us that ds(b, b 1 ) < 2e + 
2(5 + p. Since b and b' are arbitrary and p can be chosen arbitrarily small, we obtain 
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that the diameter of B (p) does not exceed 2e + 28 and therefore Ng(e + 5) < M n , 
which is exactly (J7|) when K — C or exactly ([8]) when K = M. □ 

3. TWO MORE EXAMPLES 

The two examples which we present here apply to the rather more general context 
of Theorem |U rather than to Corollary [9] 

3.1. The estimate ([8]) in Theorem |4] is best possible. 

Our example in this subsection is a simpler version of the example discussed in 
Section [T] It shows (as also follows indirectly from the example of Section [1} that 
([8]) cannot be improved: 

Let to and n be positive integers. Let A be the set of n canonical vectors 

e h j = l,2,...,n in K", i.e., e x = (1, 0, 0, 0), e 2 = (0,1,0, ,0), , e n = 

(0, 0, 0, 0, 0, 1). Divide the cube [—1, 1]™ in R™ into to™ non overlapping closed 
cubes, each of side length 2/to. Let B be the set of to™ points b = (/3i,/?2, ....,(3 n ) 
which are the centres of these cubes. Let h(a, b) be the usual inner product of vectors 
in R™ . Thus d B (b, 9) = b-b' for each b and b' in B, and d A (e j ,e k )=2-2 Im 

whenever j ^ k. This means that Na{p) — n for all p G (0, 1 — 1/m). We also have 
sup SgA £ gB h(a, b) = 1 — 1/to. Since any set in W 1 whose £^ diameter is less than 

2/to can contain at most one of the points of B : we see that Ng(p) = m n for each 
p£ (0,1/m). 

Suppose that m > 2, and choose positive numbers e = and 5 = — — 2^2- 
Note that e + 5 < 1/to and e < 1 — 1/to. So, in this case, Ng(e + S) — m n and 
Na{z) = n. Furthermore the number 

C 1 - h to- 1 



6 JL - 1 1 _ J_ 

m 2m z 2m 



lies in the interval (to — l,m]. Consequently, ([§1)^^ — m " = Ng(e + 5) and 
in this case equality holds in ([8]). 



3.2. The statement (ii) in Theorem [4] is very close to best possible. 

Our example in this subsection addresses the statement (ii) in Theorem 0J It is 
natural to ask whether (ii) might remain true if we replace e in the requirement p > e 
by some smaller number. In order to show that (ii) cannot be sharpened in such a 
way, we will give an example of sets A and B and a function /i : AxB ^ I satisfying 
the hypotheses of Theorem [4j such that, when e = 1/2, we have A^(e) < 00, but 
Ng(p) — 00 for all p G (0, e). (Since we have not yet determined whether or not 
Ng(l/2) is finite, this does not settle the question of whether or not iV^e) < 00 
always implies that Ng(e) < 00.) 

In view of © , statement (ii) of Theorem U implies the following similar state- 
ment, expressed solely in terms of intrinsic covering numbers: 

(31) If N A (e) < 00 then N B (p) < 00 for all p > 2e . 

The same example which we are about to present will also show that the number 
2e in pip cannot be replaced by an smaller number. I.e., we can have Na(c) < 00 
but Ns(p) = 00 for all p G (0, 2e). Here again we will be choosing e = 1/2. (Since 
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in our example Nb(1) = 1, this does not settle the question of whether or not we 
can always replace p > 2e by p > 2e in ([31]) .) 
Here are the details of the example: 

We take A to be the set of all real valued sequences a — {a„} n£tj which take 
values in [0, 1]. We take B to be the set of all real valued sequences b = {Pn} ne jq 
which are finitely supported and satisfy X^^Li \Pn\ < 1- For each a = {a!„} n6N in A 
and each b = {/3 T i} nS N m let h(a, b) — Y^=t a n0 n - Then of course rf^(a — a') = 
|| a — a'l^oo for all a, a' 6 A, and, for each b = {/3 n }„ 6N and b' — {/3^} IlgN , we have 
d B (b, b') = max {JXi (Pn ~ f* n ) + , £~ x (P'n ~ 0n)+} ■ 

Let x be the sequence x = {Cn}„gN wnere £n = 1/2 for all n. Then (^(a, x) < 1/2 
for all a <E A. This means that N A (l/2) = 1 < oo. Since d B (b,b') < 2 for all 
6, 6' e _B, we see that N B (1) = 1. Since c?b(6, 0) < 1 for all b £ B, we also see that 
JVa(l) = 1. 

Now let us show that Nb(p) = oo whenever < p < 1: Suppose on the contrary 
that Nb{p) — m < oo for some p E (0, 1). Then there exist m sequences in B, 
which we will denote by bj — {/3j,n} neN for j = 1, 2, to, such that 

(32) min {d B (b, bj) : 1 < j < to} < p for all be B. 

Since each of the sequences bj is finitely supported, there exists an integer q such 
that (ij A = for j = 1,2, ...,to. Now let b — {/3„} neN be the sequence for which 
fig = 1 and j3 n = for all n ^ q. Then dB(b,bj) > 1 for all j, which contradicts 
(pl2"j) . Thus indeed, for each p e (0,1), we have Nb{p) — oo, and consequently 
(cf. ©) also Ng(p/2) — oo. In other words we also have Ng(p) = oo for each 
p £ (0, 1/2). This shows that the covering numbers Na, N b and ./Vb have all the 
properties which were promised at the beginning of this subsection. 

As indicated above, we would like to know whether or not No (1/2) < oo. 

4. Appendix: Further details about the case where A is absolutely 

convex 

Here is a proof of the estimate (| 1 1 1) which was mentioned in Remark [6j 

Let e be the positive number appearing in (jlip . let n = N A (e), and let F = 

{ai, d2, , a n } be the finite subset of A depending on e which is introduced at the 

beginning of the proof of Theorem |U If Tj — {a e A : dA{a, dj) < e} for each j, 

then (|2"5]) can be restated as Ac Uj=i ^j- 

Since h(0, b) = for &\\ b £ B and since dA is related to the seminorm p by |9]), 

we see that 

(33) C := sup \h(a,b)\ — sup | ft,(a, 6) — h(0, b) \ = sup dA (a, 0) = sup p(a) . 

Let a* be an arbitrary fixed element of A. The set L = {(1 — 2t)a^, : t £ [0, 1]}, 
i.e., the line segment in V from a* to —a*, is contained in A and therefore also in 
UjLi Tj- So the interval [0, 1] satisfies 

n 

(34) [0, 1] c |J Ij , 

3=1 

where the sets Ij are defined by 

/j ={t£ [0, 1] : (1 - 2t)a* e Tj} = {t e [0, 1] : p (a* - 2*a* - a^) < e} . 
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For each j the non negative function t i— ► p(a* — 2£a* — a,j) is continuous, and in 
fact it is also convex. So each Ij is a closed (possibly empty) subinterval of [0, 1]. 
It follows from ([51]) that at least one of these intervals, say I m , must have length 
no less than 1/n. Let s be a point in I m such that s + 1/n is also in I m . Then 

— p(a*) = p( — - ) = p ( (a* — 2sa* — a m ) — ( a* — 2 ( s H — ) a* — a 
n \ n J \ \ \ n) 

< p (a* — 25a* — a m ) + p ^a* — 2 ys H ^ a* — a m ^ < e + e . 

This shows that p(a*) < en. So now we simply take the supremum in this last 
inequality as a* ranges over A and use (|33p to obtain 
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